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3% SEQUENCE LISTING 

<110> Lyamichev, Victor 

Skrzypczynski, Zbigniew 
Allawi, Hatim T. 
Way land, Sarah R. 
Takova, Tsetska 
Neir, Bruce P. 

<120> Charge Tags and the Separation of Nucleic Acid Molecules 
<130> FORS-04912 
<140> 09/777,430 
<141> 2001-02-06 
<160> 85 

<170> Patentln version 3.1 
<210> 1 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Synthetic 
<220> 

<221> misc_f eature 
<222> (1) . . (1) 

<223> The residue at this position is linked to a spacer bearing a Cy3 
group . 

<220> 

<221> misc_f eature 
<222> (1) . . (2) 

<223> The residues at these positions are amino -modified bases 
<400> 1 

ttcttttcac cagcgagacg gg 22 
<210> 2 
<211> 23 
<212> DNA 



<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 2 

tattgggcgc cagggtggtt ttt 23 

<210> 3 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 3 

cagggtgaag ggaagaagaa agcgaaaggt 30 

<210> 4 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 4 

cagggggaag ggaagaagaa agcgaaaggt 3 0 

<210> 5 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 5 

cacgaattcc gaggcgatgc ttccgctc 2 8 



-2- 



<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 6 

tcgacgtcga ctaacccttg gcggaaagcc 30 
<210> 7 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 7 

gcatcgcctc ggaattcatg gtc 23 
<210> 8 
<211> 836 
<212> PRT 

<213> Thermus thermophilus 
<400> 8 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 
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Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 



Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 



Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 



Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 



Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 



Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 



Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 



Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 



Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 
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His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 



Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 



Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 



Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 



Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 



His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 



Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 



Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 



Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 



Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 



Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 



Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 



Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 
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# 



Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 



Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 



Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 



Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 



Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 



Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 



Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 



Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 



Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 



Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 



Leu Gin Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 



Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 
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Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 , 825 830 

Ser Ala Lys Gly 
835 

<210> 9 
<211> 2511 
<212> DNA 

<213> Thermus thermophilus 
<400> 9 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


ccxggcggac 


D U 


ggccaccacc 


tggcctaccg 


cacct tcttc 


gecctgaagg 


yccLcaccac 


gagcegggge 


ion 


gaaccggtgc 


aggcgg t eta 


cggcttcgcc 


aagagccccc 


tcaaggccc t. 


g a aggagg a c 


1 on 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cct ccttccg 


ccacgaggcc 


O yl n 


tacgaggcct 


acaaggeggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


"a ri n 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


eggctacgag 




gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggt a 


egaggtgege 


yi 1 ri 
4 z U 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgccctccac 


/ion 
ft o U 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agcacggccc 


caggceggag 




cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


eggggtcaag 


600 


ggcatcgggg 


agaagacege 


cctcaagctc 


ctcaaggagt 


ggggaagect 


ggaaaacctc 


f r r\ 

660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


egggtgegea 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggeggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


egagttegge 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggectte 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcagggacg 


geegggtgea 


ccgggcagca 


1020 


gaccccttgg 


eggggctaaa 


ggacctcaag 


gaggtceggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccctc 


caacaccacc 


cccgaggggg 


tggcgcggcg 


etaegggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagece 


1320 


ctctcccggg 


tcctggccca 


catggaggee 


aceggggtae 


ggcgggacgt 


ggcctacctt 


1380 
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# 



caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


t cgaggagga 


ggtcttccgc 


1 AAf\ 


ttggcgggcc 


acccc t tcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


/*1 f*** 4~ ^ /~T ^ ^1 

yctccttgac 




gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 




gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


icon 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


i con 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag gcttagtagc 


1 *7 a n 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag gatccgccgg 


louU 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


lobU 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


± u 


gacatccaca 


cccagaccgc 


aagccyya&g 


CuCyyCy tec 


ccccggaggc 


cgtggacccc 




ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


o nzi n 
z u^t u 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


ZiUU 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


ZlDU 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 




gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


*7 o p n 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacgacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtta 


g 


2511 



<210> 10 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 10 

caggaggagc tcgttgtgga cctgga 26 

<210> 11 

<211> 836 

<212> PRT 

<213> Thermus thermophilus 

-8- 



# 



<400> 11 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 



Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 



Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 



Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 



Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 



Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 



Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 



Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 



Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 



Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 



Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 



Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 



Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 



Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 
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Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 



Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 



His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 



Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 



Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 



Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 



Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 



His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 



Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 



Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 



Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 



-10- 



Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 



Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 



Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 



Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 



Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 



Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 



Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 



Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 



Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 



Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 



Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 
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Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 



Ser Ala Lys Gly 
835 

<210> 12 

<211> 2511 

<212> DNA 

<213> Thermus thermophilus 
<400> 12 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 12 0 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacct ccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 
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gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


Q A Ci 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


yuu 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


you 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


ty cagggacg 




tjcyy y Cciy Cci 




gaccccttgg 


cggggctaaa 


ygacc tcaay 


y ayy CCCyyy 




^ o 3 rtf^r 

Uciciyy cicu U.U 




gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcy tgcccg 


gggacyaccc 


/--i ^ 4^ 4^ /-^ 4- /-^ 

Cdtgctcctc 


1 1 AC\ 


gcctacctcc 


tggacccctc 


caacaccacc 


cccgaggggg 


uggcgcggcy 


ccacgggggg 


i nnn 
J.Z u u 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggc t c c a 


tcggaacctc 


i o £ r\ 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ct t tggctct 


accacgaggt 


ggaaaagc c c 


n "3 *5 n 


Ct CCCCCgyy 


ccctggccca 


cauggagycc 




ggcgggacgt 






caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


ucgaggagga 


yytctcccyc 




ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


t gga aagy y t 


/ — f 4— y~t -f- 4~ 4/* o r^* 

yctctttyac 


IjUU 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


i ccn 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


IbzU 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


i ton 
IboU 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gc t tagtay c 




tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


cgggccagag 


gat ccgccgg 


±0\J\J 


gccttcgtgg 


c cgaggcggg 


tugggcgccg 


gtggccccgg 


dCtdLdyCCd 


yauayayc uc 


i pen 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 




gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


c c c cggaggc 


cgcggacccc 


1 jOU 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


£ Uft U 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 




ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


c c c t ggagga 


gggg a gs aa 9 


/-a 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


y— t -a f^V f~T f~X 

caacgcccgg 




gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


noon 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtta 


g 


2511 
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<210> 13 

<211> 58 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 13 

tgcctgcagg tcgacgctag ctagtggtgg tggtggtggt gacccttggc ggaaagcc 58 

<210> 14 

<211> 2526 

<212> DNA 

<213> Thermus thermophilus 

<400> 14 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggcccu 


cccgguggac 


O U 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


Tin 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


loU 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcagggacg 


gccgggtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 



-14- 



gcctacctcc 


tggacccctc 


caacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


T ft IZ ft 

12 bU 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


i o i ft 


ctctcccggg 


tcctggccca 


catggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


lioU 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1 A A ft 

144 0 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1 C ft ft 

150 0 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag gcaagcgctc 


caccagcgcc 


1 C £ ft 

1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


T c n A 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


i con 

-Lb oU 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gc ttagtagc 


~\ H Aft. 
1 /4U 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


loOO 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


T O C ft 

looO 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


^ ft O A 

iyyo 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


ft ft A ft 

2 040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


ft ~\ ft ft 
2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


ft T C ft 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


ft ft ft ft 

222 0 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


ft ft O ft 

22 80 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


ft 1 A ft 

2 34 0 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 15 

<211> 842 

<212> PRT 

<213> Thermus thermophilus 



-15- 



<400> 15 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 



Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 



Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 



Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 



Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 



Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 



Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 



Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 



Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 



Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 



Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 



Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 



Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 



-16- 



Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 



Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 



His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 



Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 



Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 



Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 



Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 



His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 



Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 



Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 



Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 



-17- 



Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 



Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 



Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 



Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 



Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 



Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 



Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 



Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 



Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 



Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 



Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 
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Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 



Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 



Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 



Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 



Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 



Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 



Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


16 


<211> 


31 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<400> 16 

gcctgcaggg gcggccgcgt gcaccggggc 


<210> 


17 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<400> 


17 



31 



ctcctggacc cttcgaacac cacccc 26 
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<210> 18 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 18 

gtcctggccc atatggaggc cac 23 

<210> 19 

<211> 2526 

<212> DNA 

<213> Thermus thermophilus 

<400> 19 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 



-20- 



gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1 "1 /"\ y"! 

1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


i ~t c r\ 
12 oO 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctxtggctct: 


accacgaggt 


ggaaaagccc 


inn 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


*^**w **l ********** •*** *^ *"* ^™ 

ggcgggacgc 


********* *** 4" "~> *~1 **\ ^ ^ 

ggcctaccu. c 


lJOU 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


t cgaggagga 


^* *** ^ ^ m /— 4 

ggcct tccyc 




ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


uggaaagggc 


yccccttgac 


13UU 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 




gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


162 0 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


i /"on 
IboU 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcuuagcagc 


1 / 4U 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


^ ********* **\ **\ **\ *** "S **w 

c gggc c agag 


^ ^ ^* ********* ********* 

gauccgccgg 


i onn 

j-ouu 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


accacagcca 


gauagagc tc 


lobU 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1 Q 1 A 

iy2 o 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


iy bo 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2 040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


ZlbU 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


o o n 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


22 oO 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 20 

<211> 842 

<212> PRT 

<213> Thermus thermophilics 



-21- 



<400> 20 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 



Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 



Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 



Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 



Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 



Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 



Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 



Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 



Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 



Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 



Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 



Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 



Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 



Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 
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Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 



Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 



His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 



Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 



Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 



Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 



Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 



His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 



Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 



Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 



Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 
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Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 



Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 



Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 



Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 



Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 



Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 



Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 



Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 



Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 



Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 



-24- 



Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 21 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 21 

caggaggagc tcgttggcga cctggaggag 30 
<210> 22 
<211> 2526 
<212> DNA 

<213> Thermus thermophilus 
<400> 22 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 12 0 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 18 0 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 24 0 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 30 0 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 36 0 

-25- 



• 



gcggacgacg 


cccccgccac 


cccggccaag 


aaggcggaaa 


aggagggg t. a 


cgaggcgcgc 


/ion 
*± z u 


atcctcaccg 


ccgaccgcga 


/™ t - 4 4 4 

cctccaccaa 


CtCytCtCCy 


4 y~-» /""* /^Y /~T /- r j-i 

aCCyCy UCgc 


CyECCCCCaC 


ft 0 u 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agcacggcct 


caggceggag 


Dfi U 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacc tccc 


cggggEcaag 


0 U U 


ggcatcgggg 


agaagacege 


cctcaagcEC 


CEcaaggagu 


y y y y cly cct 


eirt _________ _i /~* .~i f~ /-^ 


ODU 


ct caagaacc 


cggaccgggE 


aaagccagaa 


ci<aCy uccyyy 


□ rra _ /t ___ t~ a _a 
agddy dLL ad 


y y ^dv^cLLrt-* uy 


790 


gaagacc tea 


yyCLCUCCCt 


ggagc tctcc 


cyyy uy cy ggl 


/~t rzi +• / m */ m */ m * 
CCyatt LLLL 


t-*^ t--y y c*y y ^y 




gacc t cgccc 


aggggeggga 


gcccgaccgg 


gaggggcEEa 


yyyCCtLCCt 


y yciydyy cy 


0 *± \j 


gagtteggea 


gcctcctcca 


egagttegge 


cEccuggagg 


cccccgcccc 


CCtyyaygay 




gccccctggc 


ccccgccgga 


aggggc c 1 1 c 


gcgggcctcg 


ucctctcccy 


ccccg ciyGcc 


_7 D U 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


CCyyyCciyCci 


1 mn 

1U_U 


gaccccttgg 


eggggctaaa 


ggacctcaag 


gagg t ccggg 


yCC-CCtCyC 


C ciciy y »C C L. C 


1 n pn 


geegtcttgg 


cctcgaggga 


ggggc e agac 


c t cgtgcccg 


^Ti~f^ff 4 /~1 —f — | ^-i /-i 

yyy^cydccc 


/— 1 4 4— J* 4* 

CaCgCtCCLU 




gcctacctcc 


tggacccttc 


gaacaccacc 


cc cgaggggg 


cggcgcggcg 


C u dug y y y y y 


1 0 fin 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


CtCCtCtCyy 


cly cly y C _ C C cl 


tcggdaccuc 


1ZDU 


cttaagcgcc 


tcgaggggga 


ggagaagc t c 


M ^ k ^ jrihf >to 4* 4* 

ccttggctc c 


accacgaggt 


ggaaaagece 




ctctcccggg 


tcctggccca 


tatggaggee 


aceggggtae 


ggcgggacgt 


m ^ «3 mj-i4- 4— 

ggccEaccEE 


1 "3 q n 


caggcccttt 


ccctggagct 


t geggaggag 


auccgccgcc 


c cgaggagga 


gytCLLCcgc 


1 a_ n 


ttggcgggcc 


accccttcaa 


cctcaacEcc 


cgggaccagc 


eggaaagggt 


gct.cE exgae 


1 enn 


gage t taggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


IDOU 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgugg 


agaagauecE 


ccagcaccgg 


1 c *5 ri 


gage t caeca 


agctcaagaa 


cacctacgtg 


gacccccccc 


caagccEcgE 


ccacccgagg 


loOU 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


c c a c ggggag 


gcttagtagc 


1 t^i n 
1 /4 U 


tccgacccca 


acctgeagaa 


catccccgtc 


cgcaccccct 


t ggg c c agag 


gatccgccgg 


T O /I 


gccttcgtgg 


ccgaggcggg 


j_ i_ ______ _ _,J_ _-_ 

ttgggcgttg 


gtggccctgg 


actatageca 


ga t agagc t c 


1 Q C f\ 

loo U 


cgcgtcctcg 


cccacctctc 


eggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


J. J-U 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 




ctgatgcgcc 


gggcggccaa 


gaeggtgaac 


ttcggcgtcc 


tetaeggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggegg 


tggectttat 


agagegctae 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


eggggctacg 


tggaaaccct 


etteggaaga 


aggegctacg 


tgcccgacct 


caacgcccgg 


2220 
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gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 234 0 
gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 24 00 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 24 60 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2 52 0 
caccac 2526 
<210> 23 
<211> 842 
<212> PRT 

<213> Thermus thermophilus 
<400> 23 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



-27- 



Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 



Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 



Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 



Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 



Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 



Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 



Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 



His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 



Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 



Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 



Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 



Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 
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His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 



Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 



Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 



Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 



Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 



Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 



Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 



Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 



Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 



Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 
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Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 



Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 



Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 



Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 



Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 



Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 



Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 



Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 



Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 



Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 



Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 



Ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


24 


<211> 


45 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 
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<400> 24 

ggagcgcttg cctgtcttct tcgtcttctt caaggcggga ggcct 45 

<210> 25 

<211> 2526 

<212> DNA 

<213> Thermus thermophilus 

<400> 25 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 



-31- 



gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


neon 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


icon 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gc 1 1 ag t age 


1 "7 A Pi 
JL /ft U 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


T O n A 

1B0U 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


n o c n 
1 DO U 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


T q o n 

± y 2 u 


gacatccaca 


cccagaccgc 


aagctggatg 


t tcggcgtcc 


ccccggaggc 


eg tggacccc 


T qon 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


t tcggcgtcc 


LCtdCyyCdt 


gtccgcccau 


z u*± u 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagegctae 


ziuu 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 




2 ±o U 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


*^ o o r\ 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 26 
<211> 842 
<212> PRT 

<213> Thermus thermophilus 
<400> 26 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 
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Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 



Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 



Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 



Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 



Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 



Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 



Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 



Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 



Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 



Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 



Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 



Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 
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Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 



His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 



Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 



Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 



Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 



Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 



His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 



Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 



Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 



Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 



Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 



Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 



Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 
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Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 



Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 



Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 



Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 



Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 



Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 



Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 



Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 



Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 



Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 



Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 



Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 
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Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 



Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 



Ser Ala Lys Gly His His His His His His 
835 840 

<210> 27 

<211> 340 

<212> PRT 

<213> Pyrococcus furiosus 
<400> 27 

Met Gly Val Pro lie Gly Glu lie lie Pro Arg Lys Glu lie Glu Leu 
15 10 15 



Glu Asn Leu Tyr Gly Lys Lys lie Ala He Asp Ala Leu Asn Ala He 
20 25 30 



Tyr Gin Phe Leu Ser Thr He Arg Gin Lys Asp Gly Thr Pro Leu Met 
35 40 45 



Asp Ser Lys Gly Arg He Thr Ser His Leu Ser Gly Leu Phe Tyr Arg 
50 55 60 



Thr He Asn Leu Met Glu Ala Gly He Lys Pro Val Tyr Val Phe Asp 
65 70 75 80 



Gly Glu Pro Pro Glu Phe Lys Lys Lys Glu Leu Glu Lys Arg Arg Glu 
85 90 95 



Ala Arg Glu Glu Ala Glu Glu Lys Trp Arg Glu Ala Leu Glu Lys Gly 
100 105 110 



Glu He Glu Glu Ala Arg Lys Tyr Ala Gin Arg Ala Thr Arg Val Asn 
115 120 125 



Glu Met Leu He Glu Asp Ala Lys Lys Leu Leu Glu Leu Met Gly He 
130 135 140 



Pro He Val Gin Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met 
145 150 155 160 
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Ala Ala Lys Gly Ser Val Tyr Ala Ser Ala Ser Gin Asp Tyr Asp Ser 
165 170 175 



Leu Leu Phe Gly Ala Pro Arg Leu Val Arg Asn Leu Thr lie Thr Gly 
180 185 190 



Lys Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Glu lie Lys Pro Glu 
195 200 205 



Leu lie lie Leu Glu Glu Val Leu Lys Glu Leu Lys Leu Thr Arg Glu 
210 215 220 



Lys Leu lie Glu Leu Ala lie Leu Val Gly Thr Asp Tyr Asn Pro Gly 
225 230 235 240 

Gly lie Lys Gly lie Gly Leu Lys Lys Ala Leu Glu lie Val Arg His 
245 250 255 



Ser Lys Asp Pro Leu Ala Lys Phe Gin Lys Gin Ser Asp Val Asp Leu 
260 265 270 



Tyr Ala lie Lys Glu Phe Phe Leu Asn Pro Pro Val Thr Asp Asn Tyr 
275 280 285 



Asn Leu Val Trp Arg Asp Pro Asp Glu Glu Gly lie Leu Lys Phe Leu 
290 295 300 



Cys Asp Glu His Asp Phe Ser Glu Glu Arg Val Lys Asn Gly Leu Glu 
305 310 315 320 



Arg Leu Lys Lys Ala lie Lys Ser Gly Lys Gin Ser Thr Leu Glu Ser 
325 330 335 



Trp Phe Lys Arg 
340 



<210> 28 

<211> 326 

<212> PRT 

<213> Methanococcus jannaschii 
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<400> 28 



Met Gly Val Gin Phe Gly Asp Phe lie Pro Lys Asn lie lie Ser Phe 
15 10 15 



Glu Asp Leu Lys Gly Lys Lys Val Ala lie Asp Gly Met Asn Ala Leu 
20 25 30 



Tyr Gin Phe Leu Thr Ser lie Arg Leu Arg Asp Gly Ser Pro Leu Arg 
35 40 45 



Asn Arg Lys Gly Glu lie Thr Ser Ala Tyr Asn Gly Val Phe Tyr Lys 
50 55 60 



Thr lie His Leu Leu Glu Asn Asp lie Thr Pro lie Trp Val Phe Asp 
65 70 75 80 



Gly Glu Pro Pro Lys Leu Lys Glu Lys Thr Arg Lys Val Arg Arg Glu 
85 90 95 



Met Lys Glu Lys Ala Glu Leu Lys Met Lys Glu Ala lie Lys Lys Glu 
100 105 110 



Asp Phe Glu Glu Ala Ala Lys Tyr Ala Lys Arg Val Ser Tyr Leu Thr 
115 120 125 



Pro Lys Met Val Glu Asn Cys Lys Tyr Leu Leu Ser Leu Met Gly lie 
130 135 140 



Pro Tyr Val Glu Ala Pro Ser Glu Gly Glu Ala Gin Ala Ser Tyr Met 
145 150 155 160 



Ala Lys Lys Gly Asp Val Trp Ala Val Val Ser Gin Asp Tyr Asp Ala 
165 170 175 



Leu Leu Tyr Gly Ala Pro Arg Val Val Arg Asn Leu Thr Thr Thr Lys 
180 185 190 



Glu Met Pro Glu Leu lie Glu Leu Asn Glu Val Leu Glu Asp Leu Arg 
195 200 205 



lie Ser Leu Asp Asp Leu lie Asp lie Ala lie Phe Met Gly Thr Asp 
210 215 220 



Tyr Asn Pro Gly Gly Val Lys Gly lie Gly Phe Lys Arg Ala Tyr Glu 
225 230 235 240 
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Leu Val Arg Ser Gly Val Ala Lys Asp Val Leu Lys Lys Glu Val Glu 
245 250 255 



Tyr Tyr Asp Glu lie Lys Arg lie Phe Lys Glu Pro Lys Val Thr Asp 
260 265 270 



Asn Tyr Ser Leu Ser Leu Lys Leu Pro Asp Lys Glu Gly He He Lys 
275 280 285 



Phe Leu Val Asp Glu Asn Asp Phe Asn Tyr Asp Arg Val Lys Lys His 
290 295 300 



Val Asp Lys Leu Tyr Asn Leu He Ala Asn Lys Thr Lys Gin Lys Thr 
305 310 315 320 



Leu Asp Ala Trp Phe Lys 
325 

<210> 29 
<211> 328 
<212> PRT 

<213> Methanobacterium thermoautotrophicum 
<400> 29 

Met Gly Val Lys Leu Arg Asp Val Val Ser Pro Arg Arg He Arg Leu 
15 10 15 



Glu Asp Leu Arg Gly Arg Thr Val Ala Val Asp Ala Ala Asn Thr Leu 
20 25 30 



Tyr Gin Phe Leu Ser Ser He Arg Gin Arg Asp Gly Thr Pro Leu Met 
35 40 45 



Asp Ser Arg Gly Arg Val Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 



Thr Ala Ala Val Met Glu Arg Glu He Arg Val He Tyr Val Phe Asp 
65 70 75 80 



Gly Arg Ser His His Leu Lys Gly Glu Thr Val Ser Arg Arg Ala Asp 
85 90 95 



He Arg Lys Lys Ser Glu Val Glu Trp Lys Arg Ala Leu Glu Glu Gly 
100 105 110 
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Asp lie Asp Arg Ala Arg Lys Tyr Ala Val Arg Ser Ser Arg Met Ser 
115 120 125 



Ser Glu lie Leu Glu Ser Ser Lys Arg Leu Leu Glu Leu Leu Gly lie 
130 135 140 



Pro Tyr Val Gin Ala Pro Gly Glu Gly Glu Ala Gin Ala Ser Tyr Met 
145 150 155 160 



Val Lys Met Gly Asp Ala Trp Ala Val Ala Ser Gin Asp Tyr Asp Cys 
165 170 175 



Leu Leu Phe Gly Ala Pro Arg Val Val Arg Lys Val Thr Leu Ser Gly 
180 185 190 



Lys Leu Glu Asp Pro His lie lie Glu Leu Glu Ser Thr Leu Arg Ala 
195 200 205 



Leu Ser lie Ser His Thr Gin Leu Val Asp Met Ala Leu Leu Val Gly 
210 215 220 



Thr Asp Phe Asn Glu Gly Val Lys Gly Tyr Gly Ala Arg Arg Gly Leu 
225 230 235 240 



Lys Leu lie Arg Glu Lys Gly Asp lie Phe Lys Val lie Arg Asp Leu 
245 250 255 



Glu Ala Asp lie Gly Gly Asp Pro Gin Val Leu Arg Arg lie Phe Leu 
260 265 270 



Glu Pro Glu Val Ser Glu Asp Tyr Glu lie Arg Trp Arg Lys Pro Asp 
275 280 285 



Val Glu Gly Val He Glu Phe Leu Cys Thr Glu His Gly Phe Ser Glu 
290 295 300 



Asp Arg Val Arg Asp Ala Leu Lys Lys Phe Glu Gly Ala Ser Ser Thr 
305 310 315 320 



Gin Lys Ser Leu Glu Asp Trp Phe 
325 

<210> 30 
<211> 336 
<212> PRT 

<213> Archaeoglobus fulgidus 
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<400> 



30 



Met Gly Ala Asp lie Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
15 10 15 



Glu Tyr Phe Ser Gly Lys Lys lie Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 



Tyr Gin Phe lie Ser lie lie Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 



Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 



Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 70 75 80 



Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 



Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 



Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 



Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly lie Pro 
130 135 140 



Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 



Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 



Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 



Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu lie 
195 200 205 



lie He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 



Leu He Asp lie Ala lie Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 230 235 240 
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Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 



Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 



Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 



Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 

<210> 31 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<220> 

<221> misc_feature 
<222> (18) . . (18) 

<223> The n at this position can be a, c, t, or g. 
<220> 

<221> misc_f eature 
<222> (27) . . (27) 

<223> The n at this position can be a, c, t, or g. 
<400> 31 

atctctagca ctgctgtntt ygayggn 
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<210> 


32 


<211> 


31 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<220> 




<221> 


misc_f eature 


<222> 


(22) . . (22) 


<223> 


The n at this position can 


<220> 




<221> 


mi sc_f eature 


<222> 


(28) . . (28) 


<223> 


The n at this position can 


<400> 32 

gatctctagc actgctgarg gngargcnca : 


<210> 


33 


<211> 


28 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<400> 33 

gatctctagc actgctcarg aytaygay 


<210> 


34 


<211> 


31 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<220> 




<221> 


misc feature 



31 



28 



-43- 



<222> (20).. (20) 

<223> The n at this position can be a, c, t, or g. 
<220> 

<221> misc_f eature 

<222> (26) . . (26) 

<223> The n at this position can be a, c, t, or g. 

<400> 34 

cttaaggtag gactacytgn gcytcnccyt c 31 

<210> 35 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 35 

ttaaggtagg actacytcrt aytcytgrct 30 

<210> 36 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<220> 

<221> misc_feature 

<222> (28) . . (28) 

<223> The n at this position can be a, c, t, or g. 

<400> 36 

ttaaggtagg actacytcrt aytcytgnga 30 
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<210> 


37 


<211> 


30 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<220> 




<221> 


misc_f eature 


<222> 


(25) . . (25) 


<223> 


The n at this position 


<220> 




<221> 


misc_f eature 


<222> 


(28) . . (28) 


<223> 


The n at this position 


<400> 37 

ttaaggtagg actacrttrw artcngtj 


<210> 


38 


<211> 


16 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<400> 38 
gatctctagc actgct 


<210> 


39 


<211> 


17 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<400> 


39 



30 



16 



ccttaaggta ggactac 17 



-45- 



<210> 40 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 40 

tatcgcagcg atccacttct cctctgc 27 

<210> 41 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 41 

cttaaacggc aacctgagaa ggcttgg 2 7 

<210> 42 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 42 

ctatctcctt ctgcttgaaa acaggagg 2 8 

<210> 43 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 43 

acaagggaac agctcgtcga tatcgcg 2 7 
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<210> 44 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 44 

taacgaattc ggtgcagaca taggcgaact ac 32 
<210> 45 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 45 

cggtgtcgac tcaggaaaac cacctctcaa gcg 3 3 

<210> 46 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 46 

cacaggaaac agaccatggg tgcagacata ggcgaac 37 
<210> 47 
<211> 1017 
<212> DNA 

<213> Archaeoglobus veneficus 
<400> 47 

atgggtgcag acataggcga actactcgag agagaagaag ttgaacttga gtacttctcc 60 
gggagaaaaa tagctattga tgcttttaac actctttacc agttcatatc tatcataagg 120 
caacctgacg gcactccttt gaaggattct cagggtagaa tgacctcaca cctctccggc 180 
atcctgtacc gcgtgtcaaa catgatcgag gttggaatga gacccatttt cgttttcgat 240 
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ggtgagcctc ctgttttcaa gcagaaggag atagaggaac gaaaggaaag aagagctgaa 300 

gcagaggaga agtggatcgc tgcgatagag agaggagaga agtacgcaaa gaagtacgct 360 

caggcagcgg cgagggttga tgaatacatc gtcgagtcgt caaagaagct gcttgagtat 420 

atgggagttc catgggttca ggcgccgagt gagggagagg cacaggctgc atacatggca 4 80 

gcgaagggcg atgtagattt tactggctcg caggattacg actcgcttct cttcggcagc 54 0 

ccaaagcttg caagaaatct cgcgattact ggaaagagga agctgcccgg aaagaatgtt 600 

tacgttgagg tcaaaccaga gataatagac ttaaacggca acctgagaag gcttggaata 660 

acaagggaac agctcgtcga tatcgcgttg ctcgtgggaa cggactacaa cgaaggagtg 720 

aagggcgttg gggtcaagaa ggcctacaag tacataaaaa cctacggaga tgttttcaaa 780 

gctctcaagg ccttaaaggt agagcaggag aacatagagg agataagaaa cttcttcctg 840 

aacccgcctg ttacgaacaa ctacagcctc cacttcggaa agccagacga tgagaagatt 900 

atcgagttcc tgtgtgaaga gcacgacttt agcaaggata gggtagagaa ggccgttgag 960 

aagctgaaag caggaatgca agcctcgcaa tcaacgcttg agaggtggtt ttcctga 1017 
<210> 48 
<211> 338 
<212> PRT 

<213> Archaeoglobus veneficus 
<400> 48 

Met Gly Ala Asp He Gly Glu Leu Leu Glu Arg Glu Glu Val Glu Leu 



Glu Tyr Phe Ser Gly Arg Lys He Ala He Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg Met Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met He Glu Val Gly Met Arg Pro He Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Val Phe Lys Gin Lys Glu He Glu Glu Arg Lys Glu 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Lys Trp He Ala Ala He Glu Arg Gly 
100 105 110 

-48- 



Glu Lys Tyr Ala Lys Lys Tyr Ala Gin Ala Ala Ala Arg Val Asp Glu 
115 120 125 



Tyr lie Val Glu Ser Ser Lys Lys Leu Leu Glu Tyr Met Gly Val Pro 
130 135 140 



Trp Val Gin Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 



Ala Lys Gly Asp Val Asp Phe Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 



Leu Phe Gly Ser Pro Lys Leu Ala Arg Asn Leu Ala lie Thr Gly Lys 
180 185 190 



Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Glu Val Lys Pro Glu lie 
195 200 205 



lie Asp Leu Asn Gly Asn Leu Arg Arg Leu Gly lie Thr Arg Glu Gin 
210 215 220 



Leu Val Asp lie Ala Leu Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 230 235 240 



Lys Gly Val Gly Val Lys Lys Ala Tyr Lys Tyr lie Lys Thr Tyr Gly 
245 250 255 



Asp Val Phe Lys Ala Leu Lys Ala Leu Lys Val Glu Gin Glu Asn lie 
260 265 270 



Glu Glu lie Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asn Asn Tyr 
275 280 285 



Ser Leu His Phe Gly Lys Pro Asp Asp Glu Lys lie lie Glu Phe Leu 
290 295 300 



Cys Glu Glu His Asp Phe Ser Lys Asp Arg Val Glu Lys Ala Val Glu 
305 310 315 320 



Lys Leu Lys Ala Gly Met Gin Ala Ser Gin Ser Thr Leu Glu Arg Trp 
325 330 335 



Phe Ser 
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<210> 49 

<211> 53 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 49 

cccgtctcgc tggtgaaaag aaaaaccacc ctggcgccca atacgcaaac cgc 53 

<210> 50 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<220> 

<221> tnisc_f eature 

<222> (1) . . (1) 

<223> The residue at this position is linked to a spacer containing eit 
her a Cy3 or fluorescein group. 

<220> 

<221> mi sc_f eature 

<222> (1) . . (2) 

<223> The T residues at these positions have amino-modif iers . 
<220> 

<221> misc_feature 

<222> (22) . . (22) 

<223> The residue at this position is linked to a spacer containing eit 
her a Cy3 or fluorescein group. 

<400> 50 

ttccagagcc taatttgcca gt 22 
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<210> 51 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<220> 

<221> misc_f eature 

<222> (1) . . (1) 

<223> The residue at this position is linked to a spacer containing eit 
her a Cy3 or fluorescein group . 

<220> 

<221> misc_f eature 

<222> (1) . . (2) 

<223> The T residues at these positions have amino-modif iers . 
<220> 

<221> misc_feature 

<222> {22) . . (22) 

<223> The residue at this position is linked to a spacer containing eit 
her a Cy3 or fluorescein group. 

<400> 51 

ttccagagcc taatttgcca gt 22 

<210> 52 

<21X> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<220> 

<221> misc_feature 

<222> (1) . . (1) 

<223> The residue at this position is linked to a spacer containing eit 
her a TET or fluorescein group. 
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<400> 52 

ttccagagcc taatttgcca gta 23 



<210> 


53 




<211> 


23 




<212> 


DNA 




<213> 


Artificial Sequence 


<220> 






<223> 


Synthetic 


<220> 






<221> 


misc_f eature 


<222> 


(1) . 


. (1) 


<223> 


The 
her 


residue at this posil 
a TET or fluorescein 


<400> 53 
ttccagagcc 


taatttgcca gta 


<210> 


54 




<211> 


25 




<212> 


DNA 




<213> 


Artificial Sequence 


<220> 






<223> 


Synthetic 


<400> 54 
cttaccaacg 


ctaacgagcg tcttg 


<210> 


55 




<211> 


14 




<212> 


DNA 




<213> 


Artificial Sequence 



23 



25 



<220> 

<223> Synthetic 
<400> 55 

gctcccgcag acac 14 
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<210> 56 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<220> 

<221> misc_f eature 

<222> (1) . . (1) 

<223> The residue at this position is linked to a spacer containing a C 
y3 group. 

<220> 

<221> misc_feature 

<222> (1) . . (2) 

<223> The residues at these positions have amino modifications. 

<400> 56 

ttacgccacc agct 14 

<210> 57 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 57 

cgctgtctcg ct 12 

<210> 58 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 58 

gctcaaggca ctcttgccc 19 
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<210> 59 

<211> 63 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 59 

atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg 60 
ata 63 

<210> 60 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 60 

ttttttttta attaggctct ggaaagacgc tcgtgaaacg agcgt 45 

<210> 61 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 61 

cttcggagtt tggg 14 

<210> 62 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 62 

cttcggagtt tggg 14 
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<210> 63 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 63 

cttcggagtt tggg 14 

<210> 64 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 64 

cttcggagtt tggg 14 

<210> 65 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 65 

cttcggagtt tggg 14 

<210> 66 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 66 

gggttgtgga gtgagtgttc aagta 25 
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<210> 67 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 67 

ccatcctaat acgactcact atagggc 27 
<210> 68 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 68 

ctcatacagt tacttgtctt c 21 

<210> 69 

<211> 489 

<212> RNA 

<213> Homo sapiens 

<400> 69 

gaacucacua uagggcucga gcggccgccc gggcaggucc gccaccaaaa ugcagauuuu 60 
cgugaaaacc cuuacgggga agaccaucac ccucgagguu gaacccucgg auacgauaga 12 0 
aaauguaaag gccaagaucc aggauaagga aggaauuccu ccugaucagc agagacugau 180 
cuuugcuggc aagcagcugg aagauggacg uacuuugucu gacuacaaua uucaaaagga 240 
gucuacucuu caucuugugu ugagacuucg ugguggugcu aagaaaagga agaagaaguc 300 
uuacaccacu cccaagaaga auaagcacaa gagaaagaag guuaagcugg cuguccugaa 360 
auauuauaag guggaugaga auggcaaaau uagucgccuu cgucgagagu gcccuucuga 420 
ugaauguggu gcuggggugu uuauggcaag ucacuuugac agacauuauu guggcaaaug 480 
uugucugac 4 89 
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<210> 70 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 70 

ggaatacgac tcactatagg gaaagtctct gccgcccttc tgtgcctgct gc 52 
<210> 71 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 71 

aatagttaca aaatattcat ttccacaata a 31 
<210> 72 
<211> 647 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 72 

gggaaagucu cugccgcccu ucugugccug cugcucauag cagccaccuu cauuccccaa 60 
gggcucgcuc agccagaugc aaucaaugcc ccagucaccu gcuguuauaa cuucaccaau 12 0 
aggaagaucu cagugcagag gcucgcgagc uauagaagaa ucaccagcag caaguguccc 180 
aaagaagcug ugaucuucaa gaccauugug gccaaggaga ucugugcuga ccccaagcag 240 
aaguggguuc aggauuccau ggaccaccug gacaagcaaa cccaaacucc gaagacuuga 300 
acacucacuc cacaacccaa gaaucugcag cuaacuuauu uuccccuagc uuuccccaga 360 
cacccuguuu uauuuuauua uaaugaauuu uguuuguuga ugugaaacau uaugccuuaa 420 
guaauguuaa uucuuauuua aguuauugau guuuuaaguu uaucuuucau gguacuagug 4 80 
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• 



uuuuuuagau acagagacuu ggggaaauug cuuuuccucu ugaaccacag uucuaccccu 54 0 
gggauguuuu gagggucuuu gcaagaauca uuaauacaaa gaauuuuuuu uaacauucca 600 
augcauugcu aaaauauuau uguggaaaug aauauuuugu aacuauu 647 
<210> 73 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 73 

ttcttcggag tttggg 16 
<210> 74 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Synthetic 
<400> 74 

ccgtcacgcc tccttcggag tttggg 26 
<210> 75 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Synthetic 
<400> 75 

aacccaaact ccgaaggagg cgtg 24 
<210> 76 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
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<400> 76 

gcgcagtgag aatgaggagg cgtgacggt 29 



<210> 


77 


<211> 


15 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<220> 




<221> 


misc_f eature 


<222> 


(1) . . (1) 


<223> 


The residue at this 
y3 group . 


<400> 77 
ctcattctca gtgcg 


<210> 


78 


<211> 


30 


<212> 


DNA 



15 



<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 78 

aacgaggcgc acctttacat tttctatcgt 30 

<210> 79 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 79 

ccttccttat cctggatctt ggca 24 
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<210> 80 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 80 

acgatagaaa atgtaaaggt gcgc 24 

<210> 81 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 81 

cggaagaagc aagtggtgcg cctcgttaa 2 9 



<210> 


82 


<211> 


14 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<220> 




<221> 


misc_f eature 


<222> 


(1) : - (1) 


<223> 


The residue at this 
y3 group . 


<400> 


82 



cacttgcttc ctcc 14 
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<210> 83 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 83 

gctcccgcag acac 14 

<210> 84 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 84 

caaagaaaag ctgcgtgatg atgaaatcgc 3 0 

<210> 85 

<211> 50 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 85 

gaaggtgtct gcgggagccg atttcatcat cacgcagctt ttctttgagg 50 



